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In the Rosenbluth and Rosenbluth method of computing polymer configurations, the configurations 
are weighted in order to remove bias of the estimated parameters of the configurations. This weighting 
method is investigated and generalized for importance sampling and Boltzmann factors. The estimates 
are found to be unbiased in the limit for an infinite sample of configurations, but to have a bias for a 
finite sample. The standard deviations of the estimates are also derived. 
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Polymer molecules in solution have been simulated by non-self-intersecting random walks 
on a lattice by many investigators 11-7J l . The procedure is to calculate many non-self-intersecting 
random walks of a given number of steps or segments, n, and calculate a parameter of each walk, 
such as the square of the end-to-end distance, r 2 . The values of the parameter are then averaged. 
If the generated walks are a random sample of all non-self-intersecting walks of length n, then 
the average value of the parameter, such as (r 2 ), will be an estimate of the average value of the 
parameter over all non-self-intersecting random walks of length n. Of course, for very small values 
of /i, all possible walks may be generated and the average value of a parameter of the walks directly 
calculated. However, for large values of n, the number of possible walks becomes too large to gener- 
ate even on a computer, so it is possible to generate only a sample of the possible walks. 

The direct method of generating a sample of non-self-intersecting random walks is to generate 
a sample of random walks and discard those that intersect themselves. However, for large n this 
method is impractical because almost all generated random walks will be self-intersecting and 
must be discarded. Three practical methods, chain enrichment [1], dimerization [6] and the method 
of Rosenbluth and Rosenbluth [2] have been used to generate non-self-intersecting random walks 
for Monte Carlo studies of polymer configurations. This paper investigates the accuracy and bias 
of estimates of the parameters of walks generated by the method of Rosenbluth and Rosenbluth. 
Rosenbluth and Rosenbluth gave an intuitive justification, but no proof that their method is un- 
biased. Formulas for the bias and variance of the estimates are given in the appendix of reference 
[4], but without detailed derivations. Also, importance sampling and Boltzmann factors that are 
used in a current Monte Carlo study [11] are not considered in [4]. This paper gives complete deriva- 
tions of the bias and variance of the estimates and generalizes the derivations for importance 
sampling and Boltzmann factors. 

Although random walks are usually generated on three dimensional lattices for large values 
of n, the methods of generation will be illustrated for n = 4 on a square lattice, following Rosen- 
bluth and Rosenbluth [2]. All 25 nonintersecting random walks are shown in figure 1. 

In the Rosenbluth and Rosenbluth [2] method of generating random walks, only steps for 
which the walk does not intersect itself are taken. Thus, in the walk shown in figure 2, either 



1 Figures in brackets indicate the literature references at the end of this paper. 
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FIGURE I. All nonintersecting walks of 4 steps on a square lattice. 
The direction of the first step is fixed. 
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Figure 2. 77iree possible steps for a walk on a square lattice. Step C 
is not allowed for a nonintersecting walk. 

step A or B would be chosen, each with probability i. Although there is still attrition of the walks 
due to trapping [3], this method allows efficient generation of long walks. However, different walks 
are generated with different probabilities. In the example of figure 1, walks 1 to 21 are each gen- 
erated with probability (i) 3 while walks 22 to 25 are each generated with probability of (i) 2 J. 

Consider a very large number, m, of random walks generated by this method. The average 
value of r 2 over the sample will be 



1 



<r 2 > = -2^X^i = 6 - 74 



(1) 



where r 2 and Pj are the square end-to-end distance and probability for each of the 25 walks of 
figure 1. However, the correct average [2] obtained by averaging r 2 over the 25 configurations is 
7.04. This method is seen to produce compact walks with too great a probability, so simple averages 
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of the walks produce incorrect, biased, results. In order to remove the bias, Rosenbluth and 
Rosenbluth [2] weighted each configuration to give, for any parameter v, 



(i)- 



2 VkWk 

k=\ 

m 
2 W k 



(2) 



where the weights, Wk, are the reciprocals of the probabilities of the walks. They gave an intuitive 
argument, but no proof, for the weighting procedure. In the above example, Mte = 3 3 for walks 1 to 
21 and w k = 3 2 2 for walks 22 to 25. 

Also, the number of walks, T, of n steps was estimated by 



1 m 

T= — T w k . 
m ~ 



(3) 



The carets placed over v and T indicate that eqs (2) and (3) give estimates for (v) and T rather than 
their true values. 

We first prove that eq (3) gives a correct estimate. A complication arises due to trapping [2| 
of the walks, when the walk cannot be continued to n steps. In this case the walk is terminated 
and its weighting factor is defined to be zero. Let Wt be the number of trapped configuration of 
walks of less than n steps, so that every walk gives either one ol the T ra-step walks or one of the 
Wt trapped configurations. The expectation [9] of w is given by the summation of uhP\ over all 
walks where P\ is the probability of the walk. That is: 



1 



w T 



E(T)=E(w) = ^ Wi — + 2 0Pt = T. 



W{ 



(4) 



Thus eq (3) estimates the number of walks. This proof follows the method of Lehman and Weiss [8|. 
The variance and standard deviation of T are also of interest. The variance of w is 



Vairw = E(w 2 )-(Ew) 2 



(5) 



T 1 

= y w 2 — - t 2 



=2>-r. 



(6) 



Because T is an average of m independent values of w, its variance is [9] 



Var T= — Var w = 
m 



2>-p 



m 



(7) 



and its standard deviation is 



V T -I1/2 / 

<r(T)= 2>i-H / V^ 



(8) 
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The investigation of the estimate given by eq (2) that was outlined in reference [10] is now given in 

detail as follows: 

For a parameter v, define: 

"1 m 

R=-y Vk w k (9) 

k=l 

so that 

(v)=Rlf. (10) 

Let E(R) and E(T) be the expectation values of R and T, with 

E(f) = T. (11) 

The expectation of R is given by the expectation of v w as 

E{R) = y j v i w i -=y j v i = T(v) (12) 

i=l i=l 

As the sample size m tends to infinity, R and T tend to their expectation values and (v) tends to 
the average of v over the walks. Thus, eq (2) is asymptotically unbiased, i.e., it is unbiased in the 
limit of infinite sample size. 

To investigate eq (2) for large but finite values of m, (v ) is expanded about E(R) and E(T) 
to give 

„ E(R) 1 E(R) 1 - 

E(T) E(T) [E(T)f [E(T)]z 

E(R) 

+ — -*-t-[f-E(f)] 2 + higher order terms. (13) 
[E(T)f 

We now take the expectation value of (v ). The first term gives (v) and the expectation value 
of the two first order terms vanish. Expansion and substituting for E(R) andE(T) from eqs (12) 
and (11) in eq (13) gives the approximation 

E((i)) = (v) + [(v) E(f*)-E(RT)]IT* (14) 

By the definition of the variance, 

Varf = £(f 2 )-[£(f)] 2 ' (15) 

Substituting eqs 4 and 7 gives 

1 T m — \ 

E(T*)=-Ywi+ T 2 (16) 

i=i 

To evaluate E(RT), we substitute from eqs (3) and (9) and rearrange terms to give 
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J / m m \ 

E(RT)=—E[2v k w k 2wk) 

1 T 
The first term on the right hand side gives — V v- x w\. The second term is a summation of 

m (;n — 1) quantities, and because v k w k and wi are independent, the expectation value of their 
product is the product of their expectation values, so 

(m m \ 

]T ^ v k w k wi) = m(m — \)E(v k w k )E{wi) 
i* k /t=i / 

= m(m-l)E(R)E(w) (18) 

Substituting eqs (16), (17), and (18) in eq (14) finally gives 

E[(v)] = (v)-j^y (vi-(v))wi (19) 

This gives the approximate bias for the average given by eq (2). That is, (v) calculated from a 
sample of m walks will, on the average, differ from the true average (v) by the right-hand term. 
However, for increasing sample size m, the bias will approach zero, so the average is asymptotically 
unbiased. 

For the simple example shown in figure 1, the summation was evaluated to give 

£[<r 2 >] = <r 2 >-0.233/m 

Thus, for a sample size of 100 walks, the mean value of r 2 calculated by eq (2) would be on the 
average too low by about only 0.002 lattice units squared. For longer walks on three-dimensional 
lattices, it is not practical to calculate the bias from eq (19). Methods of estimating the bias for 
these cases will be given in a later publication [11]. 

Using the formulas of Ku [12], the approximate standard deviation of (v) (from its average 
value, not from the true value (v)) is derived: 



<r((i))- 



' 1 T "1 1/2 / r— 

j^S (^-<">) 2 ^J / ^ (20) 



The standard deviation decreases as the square root of the sample size. Therefore, for suffi- 
ciently large sample size, the bias will be much smaller than the standard deviation, so the bias 
may be neglected. For the example of figure 1, 

cr(</ :2 )) = 3.58/V^ 

For m=100, the standard deviation of this estimate is about 0.36, which is much larger than 
the bias of the estimate. 

To apply this method to long walks on various lattices, weighting factors, w, must be computed 
for each walk as the walk is generated. Let q be the maximum number of choices for a step of a 
walk on a lattice, i.e., one less than the coordination number of the lattice. Thus, g=3, 5 and 11 
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for square, simple cubic, and face-centered cubic lattices, respectively. Also, when two non- 
adjacent sites of a walk lie within one lattice distance of each other, they are said to form a contact. 
Thus in figure 1, walks 4, 7, 18, 19, and 22 to 25 contain a contact. Now let the ith site of a walk 
form d contacts, so the next step of the walk has q — Ct possible directions. One of these direc- 
tions is chosen randomly, so it is chosen with a probability of ~ Therefore, the probability 

q — Ci 



n-\ 



1 



of generating a particular walk of n steps is TT rr and w= IT (q — Ct). 

During generation of the walks, some of the walks are trapped, i.e., all surrounding sites are 
occupied so the walk cannot be continued. Because wt is defined to be zero for these walks, they 
should not be used in the calculation of eq (2). 

Average for Walks With Boltzmann Factors 

In calculations for polymer walks with nearest neighbor interaction energies, the averages 
over parameters of the walks multiplied by Boltzmann factors ai = exp(— pi ejkt) are desired, where 
Pi is the number of contacts of the ith walk, e is the energy per contact, k is the Boltzmann constant, 
and t is the temperature. That is, the quantities 

S=2 a, (21) 

i=l 

and 

(v)=j^ amlS (22) 

i=\ 

are to be estimated. The derivation follows as previously. Thus corresponding to eq (4). 

E{wa)=Y, w i a i~ = s (23) 



so an estimate of S from a sample of m walks is 



1 m 

S = - 2 w k a k (24) 



to estimate (v ) we propose 



2 v k(*kW k 

m 
2 a kW k 



Cv) = k ^r- (25) 



The derivation is the same as previously with aw replacing w and S replacing T (except for limits 
of the summations). Equation (19) then becomes 

E[Cv)] = (v) - -\- f (vi-iv^atwi (26) 



and eq (20) becomes 



o-«f» = 



1 T V 1 ' 2 / r- 

-j (vi-ivWatwj /V^i (27) 
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Importance Sampling 

In the described method, all allowable steps from a given site have been given equal proba- 
bilities. However, walks are sometimes performed in which the allowable steps are taken with dif- 
ferent probabilities. In this way, many walks for which a desired parameter is large (important 
walks) will be generated. The average of the parameter over this sample of generated walks will 
be calculated with a higher accuracy. This technique is called Importance Sampling [13]. One 
method used gives twice the probability to allowable steps that point toward the origin than to other 
steps. Thus, in figure 3a on the rectangular lattice, step A would be taken with a probability of £ 
and steps 3 and C would be taken with a probability of i each. In figure 3b, step A would be taken 
with a probability § and step B with probability i. With this method many more coiled walks con- 
taining large number of contacts are generated than when all steps have equal probability. 

For walks with large nearest-neighbor interaction energies of attraction, the Boltzmann fac- 
tors «i are large for walks containing many contacts. Then, for a parameter v such as end-to-end 
distance, many walks for which VkOt-k is large are generated by the above method so that more ac- 
curate values for the average of v by eq (25) should be obtained than for walks generated in the 
ordinary way. This method of generating walks will be used in reference [11]. 

Many other methods of choosing unequal probabilities for the steps may be used. For another 
example, the step in the same direction as the preceding step may be given a higher probability 
than the other steps. For any such method, weighting factors ivt may be calculated so that eq (2) 
will apply. We now derive the weighting factors for a general method of chain generation with 
Importance Sampling. 

For any step in a walk, let there be s allowable steps. Let a multiplicity mu{k= 1 to s) be as- 
signed to each step proportional to the probability of the step. For example, let steps toward the 
start of the walk be given twice the probability of other steps. Thus, in figure 3a, 5 = 3, step A has 
a multiplicity of 2 and steps B and C each have a multiplicity of 1; in figure 3b, 5 = 2, and steps 
A and B have multiplicities of 2 and 1 respectively. In general, the probability of a step is 



2 m 



(28) 



where m c is the multiplicity of the chosen step. The probability of a given N step walk is then the 
product of the factors (28) for all steps of the walk and the weighting factor is the reciprocal of the 
probability, or the products of the factors 



over all steps of the walk. 



2>* 



m c 



(29) 
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FIGURE 3. Typical configurations on a square lattice illustrating 
assignment of multiplicities to allowed steps. 
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Conclusions 

The biases and standard deviations for parameters of walks with excluded volume generated 
by the method of Rosenbluth and Rosenbluth [2] have been derived and extended to the cases of im- 
portance sampling and Boltzmann factors. The direct calculation of the biases and standard devia- 
tions involve summations over all walks so is generally not feasible. However, the formulas will be 
used to estimate the biases and standard deviations from Monte Carlo calculations in a later paper. 
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